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ABSTRACT 


The design of filters for detection and 
estimation in radar and communications systems is 
considered, with inequality constraints on the 
maximum output sidelobe levels. A constrained 
optimization problem in Hilbert space is formulated, 
incorporating the sidelobe constraints via a partial 
ordering of continuous functions. Generalized 
versions (in Hilbert space) of the Kuhn-Tucker and 
Duality Theorems allow the reduction of this problem 
to an unconstrained one in the dual space of regular 
Borel measures. A convergent algorithm is presented 
for computational solution of the dual problem. 



1. INTRODUCTION 


In many radar and communications systems, pulses of relatively long 
time duration are transmitted because of peak power limitations, and an 
operation known as "pulse compression" is performed at the receiver. This 
compression is roost commonly achieved with a matched filter [1-7], that is, 
by correlating the incoming signal with time- and/or frequency-shifted 
copies of the transmitted waveform, a technique which is well-known to be 
optimal with respect to various performance criteria. The presence of a 
signal is detected when the matched filter output exceeds a threshold value, 
and parameters such as time delay and (Doppler) frequency shift are estimated 

by locating the peak output in time and frequency. One example of such a 

. • ' . i 

system is a pulse-position modulation communication scheme [7-10], where 

information is coded in the time delays of individual pulses and then 

decoded by estimating the delay with a matched filter. Another is a linear 

FM, or "chirp" radar [2-4], where the frequencies of transmitted pulses are 

swept linearly with time and the receiver has a linear time delay vs. 

frequency characteristic of the opposite slope. 

If a signal s, non-zero on the interval (to, to+T), appears at the input 

* 

to a pulse-compression filter, the output typically consists of a main peak 
surrounded by sidelobes, as shown in Figure 1, and it is desirable in certain 
cases to reduce or restrict the height of these sidelobes. This becomes 


* 


In the case of a matched filter, this is simply the autocorrelation 
function of 8 . 


important, for instance, when a radar must distinguish among multiple 
targets, or when a pulse-position modulation system operates in an 
environment of severely degraded signal-to-noise ratio [8]. This paper 
is concerned with the design of optimal "mismatched" filters for detection 
and estimation, subject to inequality constraints on the sidelobe levels. 

Previous work in this area has been principally concerned with 
analogous problems of designing antenna arrays [2-3] or with discrimination 
against statistically-distributed clutter [2-5], an approach which is often 
equivalent to reducing the sidelobe energy [12]. Another method [9-11] 
involves constraining the sidelobes approximately, with inequalities at a 
finite set of times, and leads to an optimization problem in terms of 
ordinary differential equations with delays. 

In the next section attention is restricted to finding an optimal filter 
for detection , and a problem is formulated with the objective of 
maximizing the probability of detection subject to constraints on the 
sidelobe levels. Section 3 summarizes some results of constrained 
optimization theory in function space, which is applied in Section 4 to 
reduce the detection filter problem to an unconstrained minimization in the 
dual space. An algorithm for the solution of this dual problem is developed 
in Section 5. Finally, the problem of determining an optimal filter for 
estimation is formulated in Section 6, and its solution is seen to be 
analogous to that of the detection filter problem. 



2. Receiver model and formulation of detection filter optimization problem 

The receiver model to be considered is shown in Figure 2, its possible 
interpretations as part of a communication, radar, or other system [1-7J 
being left unspecified. The signal s is assumed to be a given real 
continuous function of time with support in [0,T] and unit energy, that is 


I M l 2 ■ | s 2 (t> dt 

— do 


T 

s 2 (t) dt “ 1 

0 


( 2 - 1 ) 


It appears at the input to the receiver, after a delay to, corrupted by 
zero-mean, stationary, gaussian white noise n with power spectral density 
No. The impulse response of the linear, time-invariant filter is to be 
determined, and is restricted to be square-integrable with support in [0,TJ. 
The two terms y and £ of the output, due respectively to the signal 
and noise, are given by 


y(t) 


r°° 

s(x-to)h (t-T) dx 

> 

— oo 


( 2 - 2 ) 


£(t) *» j°° n(t)h (t-x) dx 

— OO 


(2-3) 


and it is an easy matter to show that the output noise power is 


■f 


■r 


E{£ 2 (t) } = No h 2 (t-x) dx = No h 2 (x) dx for all t (2-4) 


where E denotes the expectation operator. 


• i-l 


It is convenient at this point to replace the filter impulse response 

'h 

h with an equivalent "receiver function" u defined by 

u(t) A h(T-t) , (2-5) 

shift the time origin to to + T, and consider the output of the filter 
to be the cross-correlation function \ p defined by 

iKO A y(t+t 0 + T) 

I 

s(T-to)h(t+to+ T-t) dx 


( 2 - 6 ) 

Because s and u both have support in [0,Tj, \p must have support 
in t-T,T] and 

T 

^(t) = u(r)s(T+t) dT = <u,s > (2-7) 

''O C 


* 

By an abuse of terminology, u will be referred to as "the filter." 


s (x+t)h(T— t) dT 

-OO 

s(T+t)u(T) dT 

-XX) 





2-3 


E (S 2 (t)} - No 


u (t) dT “ No J | u I J 2 for all t 


( 2 - 8 ) 


where s fc is the shifted time function defined by 


( s(T+ 

■‘ (T) A { o 


s(T+t) t € [0,T] 
otherwise 


(2-9) 


For this model it is well-known that the detection scheme which 
maximizes the probability of detection consists of processing the input 
with a matched filter (u = s) and comparing the output [iK0)+£(0)] to a 
threshold value. Assuming that the threshold is specified independently 
(often so as to ensure an acceptable "false alarm" probability), the 
detection probability depends only upon the output signal-to-noise ratio, in 
this case 1/No . 

More generally, if a threshold-type detection scheme is used with a 
filter which is not necessarily matched to the signal, the probability 
of detection (at t = 0) for this model is easily shown to depend monotonically 
upon the output signal-to-noise ratio 


(S/N) = (0) / E{£ 2 } = <u,s> 2 / N 0 | { u j (2-10) 

I 

The problem here will be to find a filter u which maximizes this 

irk 

value subject to the sidelobe constraint 

* 7 

All norms and inner products refer to L [0,T] unless labelled otherwise 
by a subscript » 

** ; 

It is a straightforward generalization to replace e by e(t) 
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max | \p(t) | < e | ^CO) | (2-11) 

6< | t i < T 

which is illustrated in Figure 3. This allows for a central lobe of width 
26 about t,® 0 and restricts the magnitude of iKO elsewhere to be less 
than a fraction e of ^(0), where 6 and £ are parameters to be chosen 
by the designer. 

It is implicit in the constraint (2-11) that the peak value of 
|tj>(t)'| in fact occurs at t = 0, an assumption which is justified intuitively 
by the fact that |^(0) | will be the performance index to be maximized. 

This in turn is justified by the observation that maximizing at any t f 0 
corresponds either to making a decision before the entire signal has arrived 
or to delaying the decision until some time after it has. 

It can be assumed with no loss of generality that 1^(0) = <u,s> > 0, 
since (2-10) and (2-11) are the same for both u and -u. But then the 
maximization of (2-10) subject to (2-11) is equivalent to maximizing 

<u/| I u I I ,s> (2-12) 

subject to the constraints 



2-5 


Next, note that scaling u has no effect on (2-13), so that 
maximizing (2-12) is equivalent to maximizing <u,s> over all u of unit 

energy (||u|| 2 « 1). In fact, it is also equivalent (and more convenient) 

to maximize <u,s> over the unit ball (u : ||u|| 2 <_ 1), since a nonzero 
solution of this latter problem will clearly have ||uj{ 2 * 1. Thus, the 
detection filter optimization problem has assumed the following form: 

maximize <u,s> , u£L 2 [0, T] 

subject to <u,s t > <_ e <u,s> , <5 £|tj<. T 

- <u, s > <_ e <u,s> , 6 <Jt|<_ T 

Ifujl 2 < 1 




3. Constrained optimization theory 


In this section some results from the theory of constrained 
minimization in normed linear spaces are stated, and in the next section 
they will be applied to the filter optimization problem just formulated. 

This presentation follows Luenberger [13], but the theory may also be found 
in Hurwicz [14] and elsewhere. 

Let U and Z^, i = 1, 2 , • • • n be real normed linear spaces, with 

convex cones P* C Z^, i = 1,2, ••• n, Each cone P* generates a 
■)* 

partial ordering on Z^, denoted ^ and defined by 

x 4 y <=> y-x £ P+ , x,y ez i (3-1) 

4 * 4 * 

P^ is called the positive cone of Z^, since P^ « (zCZ^: z ^ 0}, 

and the notation x •< y indicates that y-x is an interior point of P*. 

0 * 

The dual positive cone P^^ in the dual space Z^ is defined by 
* 

Pj® A [z* £ Z^* : z • z ^ 0 V z £ } (3-2) 

The function G^:U -*• Z^ is said to be convex (with respect to the partial 
ordering ^ ) if 

G i lau+(l-a)v] < aG i (u) + (l-a)G i (v), u,v€U, a€[0, 1] (3-3) 

■f 

A partial ordering is defined as in Dunford and Schwartz [15] to be a 
transitive and reflexive but not necessarily antisymmetric relation. Thus 
x^y and y x» may not imply that x = y (i.e. Z^ may not be well-ordered). 



If the ordering sign is reversed in (3-3) , then the function is said 
to be concave . Note that if : U is convex in the sense of (3-3) 

g* if . 

and C 0)» then the functional g^: V ■* R* defined 

gf (u) ■ P 1 *G 1 (u), u € U 


is convex in the ordinary sense. 

Consider the optimization problem 

minimize f(u) , u.CU 

subject to G^u) ^ 0 , i“l, 2, 

where f : U •+ R 1 and G i : U -*• Z ± , i=l, 

* * 

Lagrangian functional L : U » Zi x Zj x 
def ined by 


by 

(3-4) 



2, n are all convex. The 

* i 

•••• * Z n R for problem (3-5) is 


A n 

L(u ; PI, P 2 , ••• p ) =■ f(u) + l p • G (u) (3-6) 

i=l 

+ * 

If C , i * 1, 2, ••• n, are all given then 

L(* ; pi, pz, ••• p^) is a convex functional on U. 

The dual functional f : Z] x z* x ••• x z^ + R 1 for problem (3-5) is 
defined by 

<P Cpi, P2> ••• p_) = inf L(u; pj, p 2 , ••• p ) , (3-7) 

■ u€U 

and It is easily verified [13,12] that 4> is a concave functionals 


i 



Using these definitions, the following theorems apply to 
problem' (3-5) 

Theorem 3.1 : (Generalized Kuhn-Tucker conditions) For i - 1, 2, ••• n;- 

let U, be real normed linear spaces, as above, with each of the 
positive cones G Z^ having a non-empty interior. Let f:U R 1 

and G^: U -*• Z^ be convex, as above, and assume the existence of u € U 
such that G^(u) -< 0 for all i. 

Then jLf u° e U solves the problem 
minimize f(u), u «U 

subject to G ± (u) < 0, i=l,2 

there must exist Lagrange multipliers 0 ^ p“ £ , i=>l, 2, ••• n, such that 

(a) u° solves the problem 

minimize L(u ; p® , p ® , • • • p° ) , u £ U (3-9) 

where L is the convex Lagrangian functional defined in (3-6) 

(b) pj. * G i (u°) = 0 , i-i, 2, ••• n * (3-10) 

Proof : See [13, p.217], where it is proved for a single inequality 
constraint G(u) ^ 0 in the space Z with positive cone P + . 

This theorem becomes equivalent if one identifies Z as the Cartesian 
product space 

Z = Zi x Z 2 x "• x Z n ) (3-11) 



G:U ■* Z as the mapping 


G(u) “ [Gi(u), G 2 (u), ••• G^(u) ] , 


(3-12) 


and the primal and dual positive cones as 


4 * 4 * 4 - 4 * 

P = Pi X P 2 X ••• X P 

n 

ffi ffi © 

P a Pi X P 2 X • • • X P 


(3-13) 


The details are straightforward and will be omitted. 


The above theorem generalizes the well-known Kuhn-Tucker conditions 
for convex nonlinear programming problems in finite-dimensional spaces. 

Condition (3-10) expresses the fact that a Lagrange multiplier is non-zero 
only where the corresponding constraint is "active". The following duality 
theorem provides a method for finding the Lagrange multipliers. 

Theorem 3.2 : (Duality) In Theorem 3.1, the Lagrange multipliers 

o * 

p“ £ , i-1, 2, ••• n, solve the problem { 

maximize <f>(pl» P 2 » *** P ) ") 

n Y (3-14) 

subject to p^ ^ 0 , i=l, 2, ••• n J 

and 

min f( u ) = max 4>(pi, P 2 , ••• p ) (3-15) 

G^uHO ' ■ p^O n 

where <p is the concave dual functional defined in (3-7). Moreover, 
conditions (a) and (b) of Theorem 3.1 hold for any pj, p 2 , ■••• p n which 
solve (3-14). 



Proof : See [13, p.224]. The last statement follows by examining 

the proof [13, p.218J of Theorem 3.1, which consists of showing 
the existence of a hyperplane, defined by p®, p 2 °, ••• p^ , 
which separates two sets in R 1 x Z\ x Z 2 x ••• x 2 , It is easily 
verified that any pi, p 2 , ••• p n which solve (3-14) also define 
a separating hyperplane, and hence must yield u® through (3-9). 



4. Application to detection filter optimization problem 


(4-1) 


(4-2) 


The fact that Gi and G 2 map L z functions into continuous functions 
is easily established by noting that the translation defined in (2-9) is a 
uniformly continuous mapping from R 1 to L z [-<», <»] [16, p.183]. The four 

mappings in (4-2) are all convex, the first three by virtue of their 
linearity and the last because the norm is convex. 

The detection filter optimization problem (2-14) can now be written as 


The optimization theory of the previous section will now be applied 
to the detection filter problem formulated in Section 2. In order to 
consider (2-14) in the form (3-5) let 


U = L 2 [0, T] 

Zi“ Z 2 - C[-T, T] 
Z 3 = R l 


1 


and define f, Gi, G 2 , G 3 as follows: 


f(u) ■ — <u , s> 

[ G 1 (u) ] (t) ■= <u, s t > - £ <u, s> , te[-T, tj 
[G 2 (u) ] (t) = - <u, s t > - £ <u, s> , t£[-T, t] 


> 


G 3 (u) 


- 1 


minimize f(u) , u £ L z [0, T] 
subject to G i (u) 0 , i=l, 2, 3 



(4-3) 


where the ordering in Z 3 = R 1 is the natural one (i.e. P 3 + = R + , the 
nonnegative reals, and the partial ordering ^ in Zi = Z 2 = C[-T, T] 
is generated by the positive cone 





Pl + - P 2 + - P c + “ (x e C[-T, T] : x(t) > 0, 6 < |t| < T}, (4-4) 


In other words, 


x y <" . > x ( t) < y (t) , 6 < |t| < T 


(4-5) 


It is easy to verify that P is a closed, convex cone with 

v 

•j* 

nonempty interior [12]. 

A 

The dual space C [-T, T] is well-known [15,16] to be represented 
by M[-T, T] , the space of regular Borel measures on [-T, T], where the 

A 

linear functional 4> € C [-T, T] corresponding to m € M [-T, T] is 
obtained by Lebesgue integration. 


<j> • x 


x dm , x £ C[~T, T] 


(4-6) 


-T 


The norm of a measure in M[-T, T] is given by its total variation. 


n 


I l m l = SU P I I ™( E ^) I » m G M[-T, T] 
i=l 


(4-7a) 


where the supremum is taken over all Borel partitions { E . } of [-T, T]. In the 

1 i=l 

case of a positive measure p (i.e. p(E) > 0 for E £ [-T, T] , the 
norm is 


IIpIIm = pU-t, t]) = j 1 dp 

-T 


(4-7b) 


A non-empty interior is essential, since the proof of Theorem 1 relieB on the 
Separation Theorem. The corresponding positive cones in I.P [-T, T] , 1 < p < ■», 
have no interior points. 



The dual positive cone consists of positive measures which are 2 ero 
on (-6, 6), as follows: 

Lemma 4.1 : If the positive cone P^ + Q C[-T, T] is given by (4-4), 

then the dual positive cone £ M[-T, T] is 


P„® -{m£ M[ -T, T] : m(E) > 0 , E £ [-T, T] 
and m(F) - 0 , F £ (-6, 6)} 


Proof : Suppose x £ P , so that x(t) > 0 for 6 < |t | < T, and 


© 


m € P M as defined by (4-8). Then it is clear that 

-6 

( 

x dm > 0 . , 


x dm — 

- 

x dm + 

J 

; 

-T -T 6 


so that m must belong to the dual positive cone as defined by (3-2). 

Conversely, suppose m belongs to the dual positive cone. If 
m (E) < 0 for any EC [-T, T] , then by Urysohn's Lemma [15,16] there 
must exist x e P^ + such that 


x dm < 0 , 

-T 

which contradicts (3-2). Similarly, if m(F) 0 for any FS(-5, 6) , 
then there exists y € C[-T, T] , with support in (-6, 6), such that 

T 

j y dm 0 
-T 


(4-8) 


(4-9) 


- 10 ) 


\ 


(4-11) 



this also contradicts (3-2) 


Since y and -y both belong to 

0 

Thus m must belong to . 

' Q.E.D . 

In order to apply ( Theorem 3.1, it is necessary to assume the 
existence of a feasible solution which satisfies the inequalities strictly: 

Assumption 4.2 : There exists u e L 2 [0, T) such that 

G ± (u) -< 0 , i-1, 2, 3 ' (4-12) 

Because of the nature of the sidelobe constraints (2-11), one 
expects that if e or 6 is chosen too small, there will be no non-zero 
feasible solutions at all. On the other hand, if there is a feasible 
solution u, i.e. 

G i (u) 0 , i=l, 2, 3, (4-13) 

then it is clear from the definition (4-2) that an infinitesimal increase 
in e will allow %u to satisfy (4-12). Thus Assumption 4.2 is not a 
particularly strong additional restriction. With this assumption, the 
existence and uniqueness of a nontrivial optimal solution may be established: 

Theorem 4.3 : Under Assumption 4.2, a unique nonzero optimal solution 

exists for the detection filter optimization problem (4-3). 


Proof : The problem is to minimize f on the constraint set 

S “ {u € L 2 [0, T] : G^u) 0, i«l, 2, 3} 

- Gl 1 (-P ( , + )n Gl 1 (-P c + ) n (u : | |u| | < 1 > (4-14) 

Note that S {0} by Assumption 4.2. Since -P^ + is closed and convex, 
the continuity and linearity of Gi and G 2 imply that Gi^(-P^, + ) and 
G 2 1 (-P r + ) are also closed and convex, and the intersection S of these 
two sets with the unit ball {u : |]u|| < 1} is closed, convex, and 
bounded. But this implies that S is weakly closed^, and, L 2 [0, T] 
being reflexive, Alaoglu’s Theorem^ says that it is weakly compact. The 
continuous linear functional f is also weakly continuous^, so f(S) is 
compact in R 1 and must contain its infimum, which establishes the 
existence of an optimal u° € S. 

Now suppose that u 1 C S is also optimal. Assumption 4.2 implies 
that u° and u l are non-zero, so that ||u°|| =» | ju 1 | | ■=> 1, according to 
the discussion preceding (2-14). The element u = ku° + ^u 1 satisfies 
the constraints because they are convex, and the linearity of f implies 
that u is also optimal: 

f(u) «* f(^u° + Hu 1 ) 

= ^f(u°) + %f(u J ) = f(u°) = f(u J ) (4-15) 

Thus u must also have unit norm. 


t 


See [15], pp. 422-424 


2 


l u l 1 2 “ I l^ u ° + | | 2 “ M 1°° 1 1 2 + % < u °« u * > + *1 l u MI 


m \ + \ < u°, u 1 > = 1 (4-16) 

so that the Schwartz inequality becomes an equality: 

< u°, u 1 > « 1 

This implies that u 1 => u° so that the optimal u° must be unique. 

Q.E.D . 


Necessary conditions and dual problem 

Theorem 3.1 may now be applied to the detection filter optimization 
problem (4-3) . The spaces (4-1) and positive cones (4-4) are as 
specified by the theorem, and Assumption 4.2 asserts the existence of a 
feasible solution which satisfies the constraints with strict inequality. 
Theorem 4.3 guarantees the existence of a unique optimal solution u°, 
so by Theorem 3.1 there must exist nonhegative Lagrange multipliers 

pi°, pa 0 e ^ m[-t, t) 

P3° € R + 


(4-17) 


such that 


(a) u° minimizes 


L(u; pi 0 , P 2 °, p3°) “ f(u) +1 p 1 ° • G i (u) 


i=l 


a — <u 


1 

,8> + j [<U,8 t > - £ <U,S>]dpi°(t) 

-T 


1 

+ | t-<u,8 t > - e <U,S>}dp 2 ° (t) + P3 0 [ I 1 VX 1 1 2 - 1] 

-T 


over all u € L 2 [0, T] , and 


(b) pi 0 * Gi (u°) = 


P 2 ° 4 G 2 (u°) 


-T 

f T 

« 

-T 


[<u°, 8 fc > - e <u°, 8>]dpi°(t) = 0 


[-<u°, 8 fc > -£<U°, S>]dp2°(t) “ 0 


P 3 ° • G 3 (U°) = P3°[ j |u°( I 2 - 1] 


Changing the order of integration and rearranging terms in (4-18) 


L(u; pi, p 2 , p 3 ) ° - <u, u(pi, p 2 )> + P 3 j I ^ | 2 “ P3 


where u (pj, p 2 ) £ L 2 [0, T] is defined by 


T T 

u(pi, p 2 ) = s + (-s t + e s)dpj(t) + (s- + e s)dp 2 (t) 


-T 

T 


-T 


= s - j s t dpi (t) + j s t dp 2 (t) + e s[ 
-T -T 


dpi(t) + 


-T 


(4-18) 

(4-19) 

(4-20) 

yields 

(4-21 

T 

dp 2 (t) ] 

. 

-T 


t 


Since s and s t have been assumed continuous on [-T, T] , this is a trivial 
application of Fubini's Theorem [15, 16], 




4-B 


■ s[l + e pi 



8 t dpi (t) + 


1 


S t dp2 (t) 


(4-22) 


If p 3 ° = 0, then either L(u; pi°, P 2 °» P 3 °) has a minimum value 
of - 00 or u(pi°, p 2 °) " 0 , but in the latter case the duality theorem 
implies that f(u°) «* 0 , contradicting the existence of a unique non-zero 
optimal solution. Thus it suffices to consider only p 3 > 0, and 
completing the square in (4-21) yields 


L(u; pi, P 2 , pa) - P3 1 j u - u(pj, p 2 )/2p 3 j| 2 - p 3 “ |.|u(pl» P 2 ) | 1 2 /4p 3 

(4-23) 


This is minimized over u by 


u *» u(pi, P 2 ) / 2 p 3 


so that the dual functional is 


4>(pi, P2, P3) = - P3 - j|u(pi, P2)|| 2 /4p 3 


(4-24) 


(4-25) 


for Pi, P 2 ^ 0 and p 3 >0. 


According to Theorem 3.2, the Lagrange multipliers pi 0 , P 2 °, P 3 ° 
for the primal problem may be found by solving the dual problem of 
maximizing 4> over pi, p 2 , P 3 ^ 0. It has already been argued that 
p 3 0 > 0 and u(pi°, p 2 °) £ 0 , so the maximization over p 3 for 


1 


any pi, P 2 is easily accomplished by differentiation: 

d<j> = - 1 + ||u(pi, P 2 ) I } 2 /4pa 2 a 0 
dp 3 

P 3 ° ( P 1 t P 2 ) “ ^||u(pi, p 2 ) | | > 0 

This must be a maximum, since 

d 2 (ft - - ||u(pi, P 2 > 1 | 2 / 2 p 3 3 

dp 3 2 

- 4/| |u(pi, p 2 ) | | < 0 

Substituting (4-r26) into (4-25) , the dual problem reduces to one of 
maximizing 

<J>[Pl» P2. P3°<Pl, P 2 ) ] » - ||G(Pl. P 2 ) | | 

\ 

fc 

over pi, P 2 ^ 0 or equivalently, of minimizing 

T T 

I | u (pi » pa) | | = | |s[l + e| Jpi | ( M + e| | p 2 | | M ] - j s t dpi(t) + 

-T -T 

over p x , p 2 e P M ® S M[-T, T] . 


(4-26) 


(4-27) 


(4-28) 


s t dp 2 (t) | | 2 
(4-29) 


* The quantity to be minimized is squared for convenience in taking 
differentials. . 


Unconstrained minimization 


This constrained minimization problem will now be reduced to an 
unconstrained one by combining pi. and p 2 into a single real measure. 

Recalling Lemma 4.1, the constraint pi, p 2 € means that pi and p 2 

are positive measures on [-T, T] which are zero on (-6, 6). But the sidelobe 
constraint (2-13) implies that the integrands in the necessary conditions 
(4-19) are nonpositive, so that in each case .the measure must be zero wherever 
the integrand is non-zero. Put another way, the 

Lagrange multipliers pi and p 2 are nonzero only where their respective 
constraints are active. Since the sidelobe constraints (2-13) cannot be 
simultaneously active, the measures pi and p 2 must be mutually singular 
[15, 16], that is, each must be zero wherever the other is not. 

Consider the real measure 

P “ Pi - P2 j (4-30) 

which belongs to the subspace of M[-T, T] defined by 

M 0 [-T, T] = (m G M[-T, T] : m(E) = 0, ES(-6, 6)}, (4-31) 

since both pi and p 2 are zero on (-6, 6). 

Because of the mutual singularity of pi and p 2 , the norm of p is given by 


H-AA 


and u(pi, p 2 ) in (4-22) can be written as 


j 

u(Pi* P 2 ) " “(p) - s + ej|p|| M s " | 8 t d P(t) 


(4-33) 


-T 


Moreover, it follows from the Hahn Decomposition Theorem [15, 16] that 

every element in Mo[-T, T] is uniquely decomposable into two such 

mutually singular measures. Therefore, minimizing ||u(pj, p 2 )|| 2 over 
© 


Pi» P2 £ is completely equivalent to minimizing ||u(p)|| over 

p € M 0 [-T, T] . 

* 

The original problem (2-14) has thus been reduced to an unconstrained 
dual problem: 


minimize n(p) « ||u(p)|| 2 , p£M 0 [-T, T] 


where u(p) = s + ejlpj^s - s t dp(t) 


-T 


(4-34) 


J 


Once p° is found to minimize Theorems 3.1 and 3.2 establish 
that the optimal solution of (2-14) is 


u° = u(p°)/l jG(p°) | 


(4-35) 


The stipulation that p £ M 0 [-T, T] is not really a constraint, since this 
subspace is in fact equivalent to the measure space M{ [-T,-6] [6,T]} . 



b(t) +e||p°||s(T) - j s(T+t)dp°(t) 

u°(T) - ^ » T€[0,T] (4-36) 

| |s + e| | p° | | s - | s t dp°(t) | | 

-T 

Note that the denominator is nonzero in (4-35) because 

<u°,s>»-f(u 0 ) =-<J>( P i°, p 2 °. P3°) *= n^(p°) » | |u(p°)H (4-37) 

by (3-15), and Assumption 4.2 implies that this quantity will be > 0. 

The duality exhibited in (4-37) also provides a measure of optimality 
for approximate solutions, since 

<u,s> < <u°,s> '*» J | G (p° ) | | < | |u(p) | | (4-38) 

where p and u are any subop timal solutions of the dual and primal 
problems, respectively. Thus, a filter 

u 1 = u(p 1 )/| lutp 1 ) | | (4-39) 

may be considered "approximately optimal" if it satisfies the constraints 
(2-13) to within some acceptable tolerance ei and 

||u(p 1 )|J - <u I ,8> < e 2 (4-40) 


for some tolerance e 2 • 
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5. Computational solution of dual problem 

In the previous section the detection filter optimization problem 
was reduced to an unconstrained dual problem (4-34). An iterative 
scheme will now be developed for its solution. 

In order to utilize a digital computer for the solution, it is 
necessary to "discretize" the measure space Mo[-T,T] « M{[-T,-6]U[6,T] } 
defined in (4-31). This will be done by partitioning the time set into 
intervals (t^ , where 


f-T + jA n , j “ 0, 1, 2, ... n 




6 + (j-n-l)A n> j b» n+1, n+2, ... 2n+l 


(5-1) 


and 


A n 2 (T-«)/n 


(5-2) 


Next, define the finite-dimensional subspace 


fT X 

Mo [~T,T] = (p£ M 0 : [ x(t)dp(t) => f x(t)p(t)dt, x€ C[-T,T]} (5-3) 

J — T J _T 


where p(t) (the Radon-Nikodym derivative of the measure p) has the 
form 


P(t) - Pj , t £ (t jt t j+1 ], j - 0, 1, 2, ... 2n (5-4) 
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and p n ■ 0 so that 

p(t) - 0, t £ (t n , t n+1 l - (-6,6] 


(5-5) 


This is illustrated in Figure 4. 

In other words, a measure in M^[-T,T] is represented by a 

“staircase" function (its derivative) which is constant on each of the 

2n+l intervals of length A . Such a measure is completely specified 

2n n 

by the weights (p , ) , and 

3 j=0 

(T 2n ft . 

x(t)dp(t) = I p, 3 x(t)dt (5-6) 

-T j=0 JJ t^ 


for any x £ C[-T,T]. Its norm is given by 



(5-7) 


It will now be shown that for large enough n, the approximating 

subspace always contains near-optimum elements. An equivalent statement 

00 

is that the subspace U [-T,T] is dense in the weak-star topology 

n=l 

[15] of M 0 [-T,T] . 


Lernta 5. 1 To every e > 0 there corresponds N such that 
for n > N, M^[-T,T] contains an element p n for which 

|u n (t) - u°(t)| < e, T £ [0, T] (5-8) 


1 







5.3 


and 

n(p n ) - n(p°) < e (5-9) 

where u° is given by (4-35) and 

u n *> u(p n )/| |u(p n ) | i (5-10) 

Proof : A measure p n € Mg[-T,T] will be constructed such that 

|[fi(p°)](T) - fu(p n ) ] (x)| < e|s(T)| | ||p°|l M - l|p n H M | 

iji fji 

+ |j s t (T)dp°(t) - j s t (T)dp n (t)| < e, T £ [0,T] (5-11) 

for large enough n. The last integral may be expressed for any fixed x 

as a limit of integrals of a sequence of simple functions [15, 16] which 

approach 8 t (x). Because s is continuous, such a sequence of simple 
n 00 

functions ^ 8 t ^x)} n _ 1 is defined by 

( fc j+l 

s"(T) - J B^OOdS, t e (tj, t j+1 ], j-0, 1,2, ...2n (5-12) 

where the intervals (tj , t^ + ^] were defined in (5-1). Thus there must 
exist an n (independent of T, because of the continuity of s) such 


that 
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|j 8 t (T)dp°(t) - J ^ s"(T)dp°(t)| 

fT 2n ^j+l i 

- I - l P°(£t.,t ]) 8.(T)dt| < e/2,T € {0, T] 

J -T ■ j-0 J 3 

J (5-13) 


Referring to (5-4) and (5-6), a measure p n £ H^[-T,T] may now be 
specified by choosing 

p” ■ P°«tj,.t +1 1X j - 0, 1, 2, ... 2n (5-14) 


so that 


ij 1 s t (T)dp°(t) - | T s t (T)dp n (t)| < e/2, T £ [0 ,T] (5-15) 


From the definition (4-7a) of the norm in M[-T,T], it is clear 


that 


2n 

4 n .1 IpJI 


j=0 




'M 


(5-16) 


If equality in (5-16) is not approached with increasing n, then 
1 ( p n | | ^ can be increased, for instance by adding to p n a measure whose 
weight is y in (t k> t k+1 ], "Y in (t k+1 , t k+2 l» and zero elsewhere 
for some integer k < 2n and scalar y. Such a perturbation can be used to 


make 




»<T)| l|potl M - IIp"II m | < T6 10.T) <5-17) 

and the continuity of s ensures that for large enough n, (5-15) 
remains valid. ' 

This establishes (5-11), from which (5-8) and (5-9) follow 
immediately. 

Q.E.D. 


Steepest Descent Algorithm 

A steepest descent algorithm will now be proposed for minimizing 
n(p) in the subspace m’oI-T.T]. Lemma 5.1 establishes that the true 
optimum in Mo[-T,T] can be approached as closely as desired by increasing 
n. The functional to be minimized is 

n(p) - | |s + e( |p| | m s - j s t dp(t)| | 2 
2n 2n f^j+l 

a I I s + I IpJs + l p. J s dt|j 2 (5-18) V 

n j °o J J-0 2 tj 

(recall that (t^, t = (—6 , 6 ] and p^ =* 0) . 

The functional (5-18) is not Frechet (strong sense) differentiable 
because it contains a term 


P 


M 



(5-19) 


This norm does, however, possess a directional Gateaux (weak sense) 
differential [13, 17, 18] which is convex (but not always linear) in 
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its increment. This differential at p in the direction h is defined 
as 


6 + ||p; h|| M - lim £[||p + cth|| - MpIU, p. h € M?[-T,T] . (5-20) 

crK) + 


For each j , . it is easily verified that 


6 + |Pj ; h jl “I 


V x j > o 

|hj | » Xj - 0 

V. ” h j * < 0 


(5-21) 


from which it follows that 


6+ l Ip; h l l M “ A n l + h -j 
jes J 


a i h, + a y |h, 

n jes J n jes 0 V 


(5-22) 


where 


S + = {j : > 0} 

S = { j : Pj < 0) 


.0 A 


(j; Pj » 0} 


and S + USUS° - (0, 1, 2, ... 2n} 


(5-23) 

(5-24) 

(5-25) 

(5-26) 


It is a straightforward matter to show that the differential of (5-18) 


is 


1 
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6 + n(p; h) 


. 2n f J +1 

2 < [s + e| |p| lyB - ^ PjJ tj 8 t dt1 ' 

te« + l|pi hll^ - J o h i |^ 1+1 s T dT])> 


Interchanging integrals, this becomes 


6 n(p; h) 


in r j+1 . 

2e[l + ej ( P I L- l P <8,s >dt]6 ||p;h[| 
j=0 J; t j 

-2(1 + e|lpll M ) 2 ? \ I i+1 <s,s t >dt 


2 ? 2 ? h f t j+ 1 f t i+l 

4 n P ^ hjL J J <s. ,s >dtdT 

j=0 i=0 J j / t 1 t* t 


+ 2 


2n 


i *•“ 

K6 I |p; h| | M + l h i L i 
i=0 


where 


K ■ 2e[l + e| |p|| - l P, f J+1 <s,s >dt] 

j=0 J J tj 

^ 2[1 p | tl+1 <8 ,8 >dtdT 

j=0 2 J tj J t ± T 


- (1 + e 


M 


) 


'i+1 


<s,s t >dt], i=0, 1, ... 2n 


(5-27) 


(5-28) 


(5-29) 


(5-30) 


and ||pll M is given by (5-19). 

Finally, substitution of (5-22) yields 



The object is now to iteratively decrease the cost H(p) by choosing 
a sequence of increments h for which the differential 6 + r)(p ;h) is negative. 
Suppose for a moment that S° is empty, so that (5-31) is a linear 
functional of h, 


6 + n(p; h) 


+ - N j h j 

jesus J J 


(5-32) 


where Nj - Lj + KA r for j £ S + and j £ S~ , respectively. 

Then -6 n(p; h) will be maximized over all h of unit norm whenever h 
is aligned* with it. This means that h^ should be nonzero only where 
| N j | is a maximum, and opposite in sign to . 

The generalization to non-empty S° is straightforward if one notes 
that the last term in (5-31) can be written for any S C S° as 


i s Vj + w* i 


T (L. + KA )h . if h > 0 on S 
j£S j n i j " 


J (L - KA )h. if h < 0 on S 

jis * n j j " 


(5-33) 


An element x£.X and a functional f £ X* must satisfy the inequality 
f*x < ||f|| ||x||, by the definition of f. The functional f is said 
to be aligned with x whenever equality is achieved. 
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On S°, then, an "aligned" h will have h^ nonzero only where 

(-L. - KA ) or (L. - KA ) is positive and a maximum, its sign being 
j n j n 

positive or negative, respectively. 

To summarize, an "aligned" h £ M^[-T,T), resulting in a maximally 
negative cost differential 6 + n(p; h) , will have h^ ■ 0 for j ^ R, 
where* RC (0, 1, 2, ... n-1, nfl, ... 2n} is the set of integers for 
which the maximum 

M ■ max {0, max |l, + KA | , max |l. - KA | , 
jeS+ 3 jes- 3 

max [-L, - KA ] , max [L. - KA ] } (5-34) 

j€S° 1 " j€S° 3 

is achieved. For j £ R, the sign of h^ will be opposite to 

(Lj + KA^) or (L^ - KA n ) , whichever applies. The magnitudes |hj | , 

j £ R will be chosen equal for simplicity, although they could be 

optimized if desired. This leaves only the "step size" ][ |h | to be 

j£R 3 

adjusted at each iteration. 

Whenever M > 0, there will exist an increment h for which 
<5 + ri(p; h) < 0 and hence n can be decreased. On the other hand, if 
M = 0 for some p°, then 

6 + n(p°; h) > 0 for all h (5-35) 

* Recall that by the definition of m£[-T,T], p and h are both zero 

on (-6,6), which implies that p => h =0. 

n n 


•> * JLV 


and H must have a minimum at p°. 

Moreover, the second differential can be calculated from (5-28), 


6 n(p;h,g) 

- 2£[1+ e| |p 


M - l P 1 f i+1 <a,s >dt]6 + ||p;h,g|| 

M j-0 j • 

+ l Ip ; s I l/l Ip ; h | l M + 2 1 1 f i+1 <s t 

j=0 i=0 J tj J t A 

- 2e l f J+1< s,s t >dt[g 6 + | |p ;h| ) M + h 6 + | jp;g| | M 1 
. 1=0 } t A J J 


+ 2e 2 6 


,s T >dtdr 


For any J , it is easy to verify that 


6 IPji<y 8 jl 


if p^ = 0, h^gj < 0 


0 otherwise 


2 

Thus 6 | | p ; h.gjlj^ is not defined for all h and g, but 

6 +Z | |p; h,h| | M = 0 for all h € M?[-T,T] 


(5-36) 


(5-37) 


(5-38) 


Substituting this into (5-36) with h = g and again rearranging integrals 
yields 

2n r t , 

1 

t 

1 = 


6 + n(p;h,h) = 2| |e6 + | lp;hj |s - £h ^ +1 sdt|| 

1=0 JJ t. 


> 0 for all p, h € M^[-T,T] 


(5-39) 
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+2 

This establishes that 6 r> (p ; h,h) is nonnegative definite for all p, 
so H must be convex and have no relative minima oh M^-T.T], The 
algorithm will decrease r| at every iteration and hence approach the 
minimum . 

The steepest descent algorithm described above will produce a 
sequence of measures p* £ Mo[-T,T], i = 1, 2, ... for which the cost 
nCp 1 ) approaches a minimum on this subspace. Lemma 5.1 verifies that 
by increasing n (i.e. using finer partitions of [-T,T]), this cost 
may be made to approach the optimal cost n(p°)» p° £ Mo[-T,T]. Moreover, 

u 1 » uCp 1 )/ | luCp 1 ) | | (5-40) 


approaches the optimal filter u°. 

Recall from the end of Section 4 that duality provides a criterion for 
stopping the algorithm: the constraints (2-13) should satisfy 


<u »s t > - e<u ,s> < ei 


<u*,s t > - e<u*,s> < ei 


6 < t < T 


(5-41) 


for some tolerance ei , and the primal and dual costs should satisfy (4-40) , 


I I^P 1 ) | | ~ <u i ,8> < e 2 


(5-42) 


for some other tolerance ej. 
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6. Estimation filter optimization problem 

Sections 2-5 are concerned with determining an optimal filter for 
detection of a signal, by maximizing the signal-to-noise ratio (2-10) 
subject to the sidelobe constraint (2-11). 

Another task which radar and communications systems must usually 
perform is the estimation of unknown signal parameters, particularly 
the arrival time to of the signal. In a radar system the signal delay 
is proportional to the range of the target. In certain communications 
systems, such as Pulse Position Modulation (PPM), the signal delay 
carries some portion of the transmitted information. It is well-known 
[1] that for a given continuously differentiable signal, the maximum- 
likelihood estimate of unknown time delay is obtained by processing 
the input with a matched filter and subtracting T from the time at 
which the output achieves a maximum. 

More generally, if this same estimation scheme is used with a 
filter which is not necessarily matched to the signal, the accuracy of 
the estimate has been determined by McAulay and Johnson [11]: 

Lemma 6.1 Assuming a large signal-to-noise ratio a priori, 

the above estimate of signal arrival time has a bias proportional 

to 

<j>(0) = <u,s> «* - <u, 8> (6-1) 


) 
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If this bias is zero, the variance of the estimate is the 
reciprocal of 

J 2 (0)/E{£ 2 } - <u, s> 2 /No| |u| | 2 (6-2) 

which may be interpreted as an output signal bandwidth-to-noise 
bandwidth ratio. 


Proof : Using the same assumptions as in Section 2, with the 

additional provisos that the signal s is twice differentiable and the 
signal-to-noise ratio is large, an optimal filter for estimating signal 
arrival time may be sought. The problem will be to choose a filter u 
so that the estimate is unbiased and its variance (6-2) is minimized, 
subject to the sidelobe constraint (2-11). 

The argument which led to (2-14) may be used [12] to put this 
estimation filter optimization problem in the following form: 


maximize 
subject to 


<u, s>, u£ L 2 [0,T] 
<u, s> = 0 


<U, B fc > < £<U, 8>, 6 < |t| < T 

- <u, s fc > < e<u, s>, 6 < |t| < T 

! lull 2 < i 


>» 


i 


(6-3) 


The requirement of zero bias can be dropped, but then the expression 
for the variance becomes very complex. 
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The results of Sections 3 and 4 may be used, mutatis mutandis , 
to reduce (6-3) to an unconstrained dual problem analogous to (4-34), 

minimize v(p) - | | v(p) | | 2 , p£M 0 [-T,T] 


(6-4) 


f T 

v(p) - a + e| |p| Lr 0 - r dp(t) 

J -t 


The function r t is defined by 


r t (T) - S t (T) + <S t , s> 8(T) 


l 


S t (x) 


s(a + t)da 


(6-5) 


Once p° is found to minimize v, the optimal solution of (6-3) is given 
by 


u° » v(p°)/| |v(p°) | | , i (6-6) 

and the duality theorem implies that 

<u°, s> » \^(p°) “ | !<?(p°) | j (6-7) 

The algorithm of Section 5 for minimizing r) is equally suited to 
minimizing v; the function u(p) in (4-34) need only be replaced by 
v(p) in (6-4). 

* Theorems 3.1 and 3.2 must be extended slightly to account for the 
equality constraint <u, s> = 0. See [12] for details. 
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